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"A method of purifying DNA" 



Description 

The present invention relates to a method of purifying a set of specific DNA 
molecules to be used in DNA-DNA hybridisations, as well as to DNA probes 
containing less than 2% Cot-1 DNA. 

5 Molecular genetics is the study of nucleic acids and their role in the biology of 
the cell. At the core of this science is the technique of Southern blotting, which 
involves the hybridisation of DNA in solution to DNA immobilised on a solid 
membrane. One relatively new branch of molecular genetics, molecular cytoge- 
netics, deals with the molecular biology of the chromosome level of organisation 
10 as opposed to the DNA level. The field of molecular cytogenetics is a steadily 
expanding field whose broad implications for the study of human and other 
genomes have not yet been fully explored. Examples of the kind of experiments 
carried out within this field are given below. 

15 Fluorescence in situ hybridisation (FISH) has been applied with a multitude of 
probes of different complexity for chromosome painting (Lichter et al., 1988; 
Pinkel et al., 1 988) and chromosome bar coding (Lengauer et al., 1 993) and has 
provided the most direct and rapid way to map the chromosomal localisation of 
DNA sequences (Lichter et al., 1990), FISH to extended chromatin fibres and 

20 single DNA strands has brought the mapping resolution down to the kilobase 
(kb) range offering powerful new possibilities for the generation of high resolu- 
tion physical maps (Florijn et al., 1995, Weier et al., 1995), 

Multicolour FISH approaches taking advantage of the combinatorial use of six 
25 fluorochromes have allowed to distinguish each of the 24 human chromosomes 
by a different colour (Speicher et al.; 1996, Schrock et al., 1996). FISH to 
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extended DNA molecules has brought down the mapping resolution of this 
approach to the kb range offering powerful new possibilities for the generation 
of high resolution physical maps (Weier et al., 1995; Florijn et al., 1995). 

Interphase FISH ("interphase cytogenetics"; Cremer et al., 1986) has allowed 
the study of numerical and structural chromosome aberrations directly in the cell 
nucleus. 

Comparative genomic hybridisation (CGH ( Kallioniemi et al., 1992, du Manoiret 
al., 1 993; Joos et al. ( 1 993J has provided a powerful tool to detect non-random 
gains and losses of DMA sequences in genomic DNA (obtained, for example 
from tumour specimens). 

Procedures for the quantitative and automated evaluation of FISH experiments 
have been developed in parallel and hold the promise for fully automated optical 
mapping approaches in the future. 

Present diagnostic and research applications range from prenatal diagnosis to 
postnatal clinical cytogenetics, from studies of genetic changes in cancer {du 
Manoir, 1995; Piper et al., 1995) to biological dosimetry (Cremer et al., 1990; 
Lucas et al., 1992), from comparative chromosome mapping (Wienberg et al., 
1990; 1995) to studies of the 3D-organisation of genomes in situ (Manuelidis, 
1990; Cremer et al., 1993). 

Chromosome-specific cDNA and other libraries, as well as subregional probes, 
for example microdissection probes, YACs, BACs, PACs, cosmids that are 
presently available, often do not optimally serve the needs of molecular cytoge- 
netics applications. Several examples may serve to demonstrate the needs of 
improved probes: 

All probes presently available for Southern blotting and FISH which are 
derived, for example, from cosmid, BAC, PAC and YAC libraries contain 
interspersed repetitive sequences. This presents a problem when probes 
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derived from genomic DNA are hybridised to DNA on Southern blots or to 
chromosomes/nuclei in situ, since the interspersed repetitive sequences 
present in the probe DNA hybridise throughout the target DNA, i.e. to all 
of the DNA present on Southern blot or within all chromosomes/nuclei in 
5 situ. Thus the hybridisation to the true target sequence is obscured by 

background hybridisation everywhere else. To prevent non-specific hybri- 
disation between interspersed repeat sequences of probe and target, an 
excess of unlabeled competitor DNA is usually included in the hybridis- 
ation mix as "blocking" agent. Southern blotting techniques usually use 

10 total human DNA for this purpose (Sealey et al, 1985) and FISH, Cot-1 

DNA (Pinkel era/., 1988; Lichter et al., 1988). CoM DNA is highly enri- 
ched for sequences present more than 10 4 copies per haploid genome. 
However, the routine inclusion of commercial sources of Cot-1 DNA in 
hybridisation mixtures in excess quantities is expensive. A certain fraction 

15 of labelled, interspersed sequences will hybridise to target sequences 

even in the presence of excess Cot-1 DNA and lower the signal to back- 
ground ratio of hybridisation signals. Thus there is an urgent need for the 
development of improved probes which entirely lack repetitive sequences 
which are shared with other chromosomes and thus impair the specificity 

20 of the probes. 

In studies employing GGH to chromosomes or to DNA microarrays (Kallio- 
niemi et al., 1 992; du Manoir et al., 1 993; Schena et al., 1 995; Shalon et 
al., 1996), representational difference analysis (RDA; Lisitsyn et al., 
25 1993) or genomic mismatch scanning (Nelson et al., 1993), it would be 

clearly advantageous if the DNA used for such studies would not com- 
prise the entire complexity of a large genome, but a representative sample 
highly enriched in single copy or coding sequences, 

30 - In multicolour FISH studies employing combinatorial probe labelling (Spei- 
cher et al.; 1996, Schrock et al., 1996), it would be advantageous if 
probe sets have no repetitive sequences. Usually, so many different 
probes have to have be hybridised a with correspondingly large amount of 
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Cot-1 DNA, thus making hybridisations both expensive, bulky and liable 
to have low signal to background ratios. 



In FISH studies of chromosome evolution, probes representing entire 
5 genomes, as well as chromosome or chromosomal subregions need to be 

enriched for sequences conserved between two species of interest to 
define more readily evolutionary conserved segments along chromosomes, 
as well as evolutionary chromosomal rearrangements in species belonging 
to a given class or even to different classes. 

10 

In studies of 3D in situ human genome organisation, chromosome- and 
chromosome region-specific paint probes containing specific subsets of 
sequences would be highly useful, such as complementary sets com- 
prising coding sequences vs. non-coding sequences, "scaffold" attached 
15 sequences vs. non-attached sequences. 

If a DNA probe is generated by PCR or is present within a vector, the knowledge 
of whose sequence facilitates PCR amplification, then the probe can be further 
amplified using the existing primers. Some complex probe sets are amplified 
20 using a universal PCR amplification protocol. This means that when the probe 
set is first selected, usually from an amount of DNA corresponding from a small 
number of nuclei, it is amplified in a way that maximises amplification of all DNA 
fragments. There are at least two ways of doing this: DOP-PCR and linker- 
adapter PCR. 

25 

Since 1992, universal DNA amplification procedures have been introduced that 
allow the amplification of any DNA sources employing primers which contain a 
stretch of random base pairs and another stretch with a specific DNA sequence 
{Telenius et al., 1992, Bohlander 1992). The method described by Telenius et 
30 al., ( 1 992) termed degenerate oligonucleotide-primed (DOPJ-PCR is well establis- 
hed in our laboratories, with the same oligonucleotide ("6MW", 5'-CCG ACT 
CGA GNN NNN NAT GTG G-3') and conditions as Telenius et al., (1 992). During 
the first five PCR cycles, which are performed under conditions of lower strin- 



WO 98/15649 PCT/EP97/04199 

5 

gency (i.e. a low annealing temperature}, the primer part comprising the random 
(N 6 ) sequence can hybridise to many sites of any complex DNA source. Subse- 
quent cycles performed under stringent conditions (i.e. a higher annealing 
temperature) should, in theory, allow the specific, further amplification of those 
5 DNA fragments in which the specific primer sequence has been incorporated 
during the first amplification cycles. 

Linker-adapter PCR strategies involve the generation of specific target sequences 
at the end of DNA fragments which hybridise to unique primer pairs. This 
10 approach allows the amplification and reamplification of complex sources of 
DNA such as DNA from microdissected chromosomes, chromosome arms and 
chromosome bands (Ludecke et al., 1989; Vooijs et al., 1993). 



Subtractive hybridisation involves the hybridisation of two sets of DNA moiecu- 

1 5 les, the source DNA and the subtractor DNA, Usually, the contents of the these 
two DNA sets overlap. The relative amounts of source and subtractor DNA can 
be altered to drive the hybridisation kinetics to favour either positive or negative 
selection. Positive selection, in which the number of source molecules is usually 
greater, would involve isolation of st/6f/-acror-homologous source DNA. Negative 

20 selection, in which the number of subtractor molecules is greater, would involve 
isolation of subtractor-nonhomologous source DNA. In addition, the selected 
DNA can be put through more round(s) of selection to further improve the 
selection process. Subtractive hybridisation was first used to clone the differen- 
ces between two overlapping pools of cDNAs; double-stranded hybrid molecules 

25 were separated from free single-stranded cDNAs by chromatography through 
hydroxylapatite (Kurtz and Feigelson, 1 977; Affara and Daubas, 1 979; Timberla- 
ke, 1980}. This enabled only negative selection. With the substitution of biotin- 
labelling of the subtractor DNA followed by avidin-affinity capture and release 
techniques, it became possible to both positively and negatively select subsets 

30 of source DNA (Welcher et al., 1986). This technique was further improved by 
the addition of a post-selection inter-Alu-PCR amplification step (Rounds et al., 
1995). 
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The concept of removing repetitive sequences from chromosome-specific libra- 
ries employed for the RSH-visualisation of entire chromosomes and chromoso- 
mal subregions by affinity chromatography procedures and alternatively to 
suppress the undesired hybridisation of hapten-labelled repetitive probe sequen- 
5 ces with Cot-DNA fractions as blocking agents was first tested by T. Cremer 
and P. Lichter when they worked together with David C. Ward at the Dept. of 
Genetics, Yale University (1986-1988). Copper chelation affinity chromatogra- 
phy, a well known procedure for the purification of proteins, had already been 
modified there for use in the enrichment of specific DNA sequences from com- 
10 plex DNA sources (Welcher et al., 1986). 



The following strategy to produce a human chromosome 21 "paint" probe library 
depleted in interspersed repetitive sequences, as well as other repetitive sequen- 
ces cross-hybridising to other acrocentric chromosomes was tested at that time. 

1 5 in the following strategy, source DNA is represented by a chromosome 21 paint 
probe and the subtracter by biotinylated and 3 H-labelled sequences representing 
flow sorted chromosomes 13 and 18, and cloned repetitive sequences. These 
subtractor sequences were used in excess (50 fold) and mixed with chromoso- 
me 21 subtractor sequences end-labelled with 32 P. The radioactive labelling was 

20 performed as a simple means to follow the presence of subtractor and source 
sequences and the intended fractionation of the latter. Following hybridisation of 
subtractor and source DNA in suspension, and incubation with avidin, the 
hybridisation mixture was subjected to copper chelation chromatography. The 
affinity of the columns for avidin was used with the intention of removing all 

25 biotinylated subtractor DNA sequences and subtractor DNA-source DNA hybrids 
( 3 H labelling was used to monitor the bound fraction and possible leakage of the 
column). The expected enrichment of chromosome 21 -specific, 32 P-labelled 
source sequences in the flow-through fraction {negative selection) was monito- 
red by a filter hybridisation assay with spots representing the various compo- 

30 nents of the subtractor DNA and the chromosome 21 source DNA. After several 
cycles, a strong increase in the intensity of the chromosome 21 spot demon- 
strated the enrichment of chromosome 21 -specific sequences in the flow- 
through fraction. 
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However, a major drawback of this approach resulted from the fact that the 
columns retained the large majority but less than 100% of the biotinylated 
subtracter DNA. This means that the fraction of excess biotinylated subtractor 
sequences recovered in the flow-through fraction reflected a total amount similar 
5 or even higher than the amount of the purified source DNA. This fraction of 
subtractor DNA which leaked into the flow through fraction made the total DNA 
recovered in the flow through fraction at this stage of the development useless 
as a probe for immediate molecular cytogenetic applications, in spite of the 
experimentally-demonstrated enrichment of chromosome 21 unique sequences. 

10 As a result of this leakage, repetitive sequences, which were depleted from the 
21 library probe, were replaced in the flow-through fraction to a significant 
extent by repetitive sequences from the subtractor DNA. Accordingly, this 
attempt to prepare a 21 -specific chromosome paint probe which could be used 
without Cot-1 DNA in the hybridisation mixture failed, i.e. FISH experiments of 

15 the resulting probes to metaphase spreads did not yield a specific painting of 
chromosome 21, but still visualised the entire chromosome complement. In the 
case of positive selection (i.e. subsequent elution of a fraction bound to the 
column) the situation is even worse, since all subtractor DNA bound to the 
column will be eluted together with the subtractor-source DNA hybrids, using 

20 this approach. 



Two recent publications have used two modifications to the previously-used 
subtractive hybridisation protocol to positively select chromosome-specific 
cDNAs (Chen-Liu et al„ 1995; Rouquier et aL, 1995). Firstly, source cDNA 

25 libraries were made universally amplifiable by the addition of linker-adapter ends. 
Using single chromosome libraries as subtractor DNA, chromosome-specific 
cDNAs were PCR-amplified after positive selection. A further step was achieved 
by using streptavidin-coated magnetic beads for affinity chromatography, hence 
reaction volumes were kept to a minimum. In addition, single-stranded source 

30 molecules could be released from immobilised subtractor molecules by alkaline 
denaturation after stringent washes, thus further improving the purity of the 
released source DNA. When two consecutive rounds of positive selection were 
employed, this (ed to a selection of chromosome-specific cDNA. However, the 
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method described in these two publications have used Cot-1 DNA to suppress 
any repeat-repeat hybridisation within or between subtracter and source DNA 
(Chen-Liu et al., 1995; Rouquier et aL, 1995). This makes the process of positi- 
ve selection more complicated and makes negative selection almost impossible 
(the unlabelled Cot-1 DNA would pass through into and heavily contaminate the 
supernatant to be amplified for negative selection). 

Thus, the technical problem underlying the present invention is to provide a 
novel and generally useful strategy to fractionate and purify DNA from entire 
genomes, as well as from chromosome-specific or chromosome segment-specific 
DNA probes. This approach can be used to develop probe sets for molecular 
cytogenetics with novel and different characteristics, such as probe sets com- 
prising coding sequences, expressed sequences, sequences conserved between 
two distantly-related species, etc. Such probe sets will be generated in a way 
that they can be reamplified and used without any additional "blocking" agents 
such as Cot-1 DNA. These novel probe sets wilt become highly useful in diagno- 
stic and research settings. 

The above technical problem is achieved by providing the embodiments charac- 
terized in the claims. 

In particular, there is provided a method of purifying specific DNA molecules, 
comprising the steps of 

(a) mixing a set of DNA molecules as source DNA containing said specific 
DNA molecules as a subset, with a set of molecules as subtractor having 
substantially 

(i) an affinity for the subset, or 
(it) no affinity for the subset, 

(b) performing a binding reaction between source DNA and subtractor in 
solution, 

(c) separating the subtractor which is present unbound or bound to source 
DNA, by binding the subtractor to a matrix material containing com- 
pounds having an affinity to the subtractor, from the reaction mixture.and 
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(d) recovering said subset which is either 

(i) bound to the subtracter, or 

(ii) not bound to the subtracter. 



5 In the case of the feature of step (a) (ii), the subtracter having substantially no 
affinity for the subset, exhibits an affinity for substantially all other DNA molecu- 
les contained in the source DNA. 

The recovered subset obtained in step (d) (i) derives from a "positive selection" 
using preferably an excess of source DNA. The recovered subset obtained in 
10 step (d) (ii) derives from a "negative selection" using preferably an excess of 
subtracter. 



In a preferred embodiment, the subtractor is a set of DNA molecules and the 
binding reaction in step (b) is a hybridisation. In this aspect, the hybridisation 

15 may be performed by denaturing source DNA and subtractor mixed together in 
a buffer containing 0.075 M to 1.5 M NaCI, at 90 to 100°C for 1 to 10 minutes 
and reannealing at 60 to 70°C for 5 to 48 hours. Alternatively, the hybridisation 
may be performed by denaturing source DNA and subtractor mixed together in 
a buffer containing 40 to 70% v/v formamide, 0.03 M to 0.75 M NaCI, at 60 to 

20 80°C for 1 to 10 minutes and reannealing at 30 to 50°C for 5 to 48 hours. 

In a further preferred embodiment of the present invention, the source DNA is a 
DNA probe used in FISH. The source DNA may comprise DNA of a defined 
function or a repeat-free chromosane-specific library. Examples are cDNA, CpG 
25 islands, scaffold-attached DNA and DNA of a defined replication time. 

The subtractor may be a set of labelled molecules such as DNA molecules, 
wherein the percentage of labelling is > 90%, more preferred > 95%, and most 
preferred S: 97% such 99.5%. The molecules are labelled e.g. with biotin or 
30 digoxygenin, wherein the label or labelling method are not specifically restricted 
and any label or labelling method known in the art can be used within the 
present invention, In a preferred embodiment of the present invention, the 
subtractor is Cot-1 DNA. 
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Cot DNA means a double stranded DNA formed after a mixture of DNA molecu- 
les, at a known concentration of DNA and salt, which has been denatured and 
reannealed until a certain value of the product of molar concentration (CJ and 
time (t) has been reached. Cot-1 DNA represents the double stranded DNA 
5 formed after DNA has been denatured and reannealed until the product of molar 
concentration and time is 1 , and represents sequences repeated more than 
10,000 times per haploid human genome. 

The subtractor may comprise a repeat-free chromosome-specific library or DNA 
of a defined function. Examples are cDNA, CpG islands, and scaffold-attached 
10 DNA. 

Preferred embodiments of the present invention with respect to source DNA and 
subtractor are as follows: 

the source DNA comprises repeat-free DNA from one species and the 

15 subtractor comprises repeat-free DNA from another species; 

the source DNA comprises repeat-free DNA from one species or DNA 
sequences shared between two different species, and the subtractor is a 
repeat-free chromosome-specific library from another species; 
the source DNA is a repeat-free chromosome-specific library from one 

20 species, and the subtractor comprises repeat-free DNA from another 

species or DNA sequences shared between two different species; 
the source DNA comprises repeat-free DNA from one particular tissue 
and/or developmental stage, and the subtractor comprises repeat-free 
DNA of a defined function from another particular tissue and/or develop- 

25 mental stage of the same organism. 

The matrix material used in step <c} is not specifically restricted, and any matrix 
material known in the art can be used within the present invention. The com- 
pounds contained in the matrix material are preferably immobilized to the matrix 
30 material. In a preferred example, the matrix material contains streptavidin such 
as streptavidin-conjugated magnetic beads. 

In another embodiment of the present invention, the source DNA contains 
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thymidine analogues, The subtracter contains anti-thymidine analogue antibodies, 
and the matrix material contains compounds capable of binding to the anti- 
thymidine analogue antibodies. Alternatively, the source DNA contains bromode- 
oxyuridine (BrdU), the subtractor contains anti-BrdU antibodies, and the matrix 
5 material contains compounds capable of binding to the anti-BrdU antibodies. 

Prior to step (a) of the method according to the present invention, the source 
DNA may be subjected to PCR such as DOP-PCR. In a preferred embodiment, 
prior to step (a), the source DNA is subjected to a further round of PCR with a 
10 second primer whose 5'-portion comprises a sequence of nucleotides not pre- 
sent in the DOP primer, and whose 3'-portion consists of a number of the non- 
random nucleotides from the 5'- end of the DOP-primer. Preferably, the DOP- 
PCR primer ("first primer") has the following sequence: 

15 5'-CCG ACT CGA GNN NNN NAT GTG G-3' 

wherein N may be any nucleotide. 

Further, the second primer may have the following sequences: 

20 

5'-CTA CTA CTA CTA CCG ACT CGA G-3', or 
5'-TGA TCA CGC TAC CCG ACT CGA G-3'. 

After step (d) of the method according to the present invention, the recovered 
25 subset may be subjected to PCR. Further, steps (a) to (d) are repeated at least 
once using each recovered subset obtained in step (d) as source DNA in step 
(a). 

The subset obtained according to the method of the present invention may be 
30 used as a probe for DNA-DIMA hybridisations such as Southern blotting or FISH. 



A further subject of the present invention relates to a DNA probe containing less 
than 2% Cot-1 DNA. 
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The source DNA are preferably selected from probes cloned in vectors, complex 
probe sets created by PCR amplification and/or microdissection or flow-sorting, 
repeat-free chromosome-specific library, repeat-free genomic DNA purified from 
cells with tymidine analogues incorporated into R- or G-bands, early or tate 
5 replicating genomic DNA and repeat-free genomic DNA. The source DNA origina- 
tes from any living material such as plants, fungi, bacteria, animals and humans, 
and specific tissues or cells thereof. Examples are: Humans, mice, sheep, cows, 
horses, pigs, goats, rabbits, ostriches, chickens, pigeons, maize, wheat, rice, 
barley, oats, rye, sorghum, millet, and yeasts such as Saccharomyces cerevi- 
10 siae, Schizosaccharomyces pombe. 



The Figures show: 



Figure 1 is a general set-up of one preferred embodiment of the present inven- 
i 5 tion. a, General schematic diagram, and b, as applied to the removal of repetitive 
DNA from FISH probe sets. PCR-amplifiable probe sets (acting as "source" DNA) 
are hybridized in solution with biotinylated Cot-1 DNA (acting as "subtracter" 
DNAJ. Cot-1 :Cot-1 and Cot-1 : probe set hybrids are removed using streptavidin- 
magnetic bead affinity chromatography, leaving the repeat-depleted sub-fraction 
20 of the probe set (negative selection) to be purified and PCR-amplified. 

Figure 2 is a fluorescence in situ hybridization of probe sets (A, B, 15q library; 
C, D, genomic DNA) without Cot-1 DNA, after two rounds of repeat depletion. 
Images were taken directly through the microscope with a conventional camera; 

25 no image processing was employed. A, C: DAPI counterstaining; B, D: biotin- 
labelled, avidin-FTTC-detected depleted probe sets. In B, two metaphase chro- 
mosomes (center, indicated with arrowheads in A} and two interphase chromo- 
somes 15q (left) have been painted. Background hybridization to other, non- 
target chromosomes is relatively low compared to the target chromosomes, 

30 indicating that sufficient repeats have been removed frorm the probe set as are 
necessary for Cot-1 -free FISH. In D, the dentromeric satellite DNA of chromoso- 
me 1 (arrowheads), the heterochromatin of the Y chromosome, plus some non- 
hybridising centromeres (arrows) are indicated as examples of regions of repetiti- 
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ve DNA that have been effectively removed from the genomic DNA probe set. 

According to the present invention, a preferred embodiment is that > 95% of 
subtractor molecules are labelled such as biotin-labelled; any unlabelled molecu- 
les will not be captured by the matrix material such as streptavidtn-magnetic 
beads. In the case of positive selection, this would result in the loss of a subset 
of subtracter-homologous source DNA sequences. In the case of negative 
selection, this would lead to the contamination {dilution) of the source DNA with 
subtractor DNA and the possibility that a subset of unwanted, sub tractor- homo- 
logous source DNA sequences were not captured by affinity binding and would 
therefore also be amplified by a resulting PCR amplification, thus making selec- 
tion incomplete. 

In one preferred embodiment of the present invention, stringent PCR amplifica- 
tion is central to the claimed method. If PCR reactions are not sufficiently 
stringent, contaminating subtractor DNA may be co-amplified with source DNA 
after recovering in step (d) of the method according to the present invention. 
Sequence-specific PCR and linker-adapter (universal) PCR are stringent enough 
in this respect. However, although the universal DOP-PCR technique has the 
advantage over Linker adapter PCR that it requires no pre-ligation of linker 
molecules, based on the fact that even when the so-called "stringent" (high 
annealing temperature) DOP-PCR cycles are used to reamplify DOP-PCR probe 
libraries, amplification is not completely restricted to DNA molecules which carry 
DOP primer sequences at their ends. Thus, a certain likelihood exists that conta- 
minating subtractor sequences could still be amplified. DOP-PCR is the most 
widely-used technique used generate complex FISH probes. From the starting 
material of flow-sorted or microdissected genomes, genome subsets, chromoso- 
mes and chromosome subsets down to the level of single chromosome bands, 
DOP-PCR is used as a universal PCR amplification technique to generate amplifi- 
able probe sets. It is also the ideal protocol to convert probes that are not 
already PCR-amplified into PCR-amplified probe sets, thus eliminating the need 
for cloning in vectors. 
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In one preferred embodiment of the present invention, a new DOP-PCR primer is 
provided to exclude the degenerate sequences from the priming reaction of PCR 
amplification. This was done by taking the first ten, specific nucleotides from the 
"6MW" primer and adding twelve more nucleotides to the 3' end. The new 
5 primer (5'-CTA CTA CTA CTA CCG ACT CGA G-3', also called "CTA 4 DOP". 
This primer was used in a modified DOP-PCR reaction to effectively add the 
CTA 4 sequence onto all source DNA molecules before the start of step (a) of the 
method according to the present invention. Tests showed that using a slightly 
modified, stringent set of CTA 4 D0P-PCR cycles that no amplification of contami- 
10 nating sequences occurred. 

While previous studies on PCR-assisted affinity capture implied and performed 
only positive selection of source DNA (Chen-Liu et al., 1995 and Rouquier et al., 
1 995) to a certain extent, the method of the present invention can be also used 
successfully for negative selection. For example, after separation of e.g. biotin- 
labelled, subtractor-subtractor and subtractor-source molecules, all subtractor- 
nonhomologous source sequences should be left in the supernatant. One extra 
step of e.g. streptavidin-magnetic particle affinity binding with this negatively- 
selected supernatant fraction is performed to ensure the capture of any remai- 
ning subtractor sequences. This effectively ensures that >99.5% of all sub- 
tractor molecules have been removed during negative selection. 

Using the method according to the present invention, repetitive DNA can be 
successfully removed from complex probe pools {see Figure 2). If all subtractor 
25 and source DNA sets (apart from those involved in the actual process or repeat 
removal) pre-treated in this way, prehybridisation of either DNA with Cot-1 DNA 
can be entirely avoided and negative selection would be made much easier. 

The general set-up of one preferred embodiment of the present invention is 
30 illustrated in Figure 1a. The subtractor DNA comprises the sequences for which 
positive or negative selection is required. The source DNA comprises the probe 
set from which to select either su^f/'actor-homologous sequences (positive 
selection) or subtractor-non homologous sequences {negative selection). The 
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source DNA probe or probe set is first amplified using PCR methods, the most 
widely-used of which will be the above defined DOP-PCR technique using the 
CAT 4 DOP-primer. The subtracter DNA is labelled (e.g. with biotin using Biotin 
Chem-Link, Boehringer Mannheim). Source DNA is hybridised to subtractorDHA 
5 then mixed with streptavidin-magnetic particles. In positive selection, source 
DNA molecules containing st/6fracror-homologous sequences are purified using 
high stringency washes followed by alkali denaturation and universal PCR. In 
negative selection, source DNA molecules containing no sub factor-homologous 
sequences are purified using a further incubation with streptavidin-magnetic 
10 particles followed by universal PCR. This selected DNA can be tested using 
normal hybridisation techniques and subjected to further rounds of selection if 
required. In this respect, it has been demonstrated that as many as five consecu- 
tive reamplifications of a DOP or linker-adapter chromosome library do not 
apparently decrease the complexity of the probe set as tested by FISH. 

15 

The present invention makes possible the development of novel probes with 
many different applications in diagnostic and research settings. In addition, 
reamplifiable probes allow both small and large scale probe applications. 



20 



In the following, preferred embodiments of the present invention are summari- 
zed: 
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(1) Preparation of repeat-free PCR-amplifiable DNA probes 



Type of selection: 


Negative 


Source DNA: 


PCR-amplified DNA probes., e.g. probes cloned in 
vectors such as phages, cosmids, YACs, BACs, 
PACs; complex probe sets created by PCR am- 
plification and/or microdissection or flow-sorting, 
e.g. band, arm or whole chromosome-specific 
probe sets. 


Subtractor DNA: 


Biotin-labelled Cot-1 {or higher Cot) DNA 


Desired DNA: 


PCR-amptified DNA from the supernatant fraction 
after streptavidin-magnetic particle affinity chro- 
matography. 


Expected characteristics of 
new probe sets: 


Repeat-free probe sets that require little or no 
suppression before DNA-DNA hybridisation and 
which achieve low levels of background hybridis- 
ation combined with the desired specificity. 



Applications: Production of repeat-free, PCR-amplifiable probe sets for DNA-DNA 
hybridisations such as Southern blotting or in situ hybridisation of human and 
other species for diagnostic and research use. Major examples are PCR-amplified 
arm- or whole-chromosome probe sets for multicolour FISH assays, genomic 
DNA for comparative genomic hybridisation to chromosomes or matrices or in 
representational difference analysis (RDA; Lisitsyn et al., 1993) or genomic 
mismatch scanning (2-20 kb; Nelson et al., 1993). 
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(21 Preparation of chromosome-specific functionally-defined DNA probes 



Type of selection: 


Positive 


Source DNA: 


Repeat-free PCR-amplified chromosome-specific 
library (see embodiment 1 ) 


Subtracter DNA: 


(Repeat-free and/or PCR amplified) functionally-de- 
fined subsets of DNA, e.g. cDNA libraries, CpG is- 
land libraries, scaffold-attached regions (SARs), 
DNA of defined replication time. 


Desired DNA: 


PCR-amplified DNA from the bound fraction after 
streptavidin-magnetic particle affinity chromatogra- 
phy. 


Expected characteristics 
of new probe: 


Chromosome-specific DNA of a defined function. 



10 Note: Source and subtractor DNAs are interchangeable for this embodiment. 



Applications: Detection of metaphase and interphase chromosome subregions 
with a defined function, e.g. to detect regions containing a high density of 
(active) genes. Screening of cosmid, BAC, PAC and YAC libraries for expressed 
15 sequences. 



(3) Preparation of genomic probes enriched in evolutionary conserved sequen- 
ces 



Type of selection: 


Positive 


Source DNA: 


Repeat-free, PCR-amplified genomic DNA from one 
species (see embodiment 1) 


Subtractor DNA: 


Repeat-free genomic DNA from another species 


Desired DNA: 


PCR-amplified DNA from the bound fraction after 
streptavidin-magnetic particle affinity chromatogra- 
phy. 


Expected characteristics 
of new probe: 


Sequences conserved between the two genomes; 
the extent of conservation depends on the stringen- 
cy conditions used for hybridisation in suspension. 
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Applications: FISH of the probe obtained from one species to metaphase spreads 
from the other species provides a tool for the comparative physical mapping of 
highly conserved sequences. Such an approach will be particular interesting for 
distantly-related species. 



(4) Preparation of chromosome-specific probes enriched in evolutionary- 
conserved sequences 



Type of selection: 


Positive 


Source DNA: 


Repeat-free, PCR-amplified DNA from one species 
or DNA sequences shared between two different 
species (see embodiment 3) 


Subtractor DNA: 


Repeat-free chromosome-specific library from 
another species 


Desired DNA: 


PCR-amplified DNA from bound fraction after strep- 
tavidin-magnetic particle affinity chromatography 


Expected characteristics 
of new probe: 


Chromosome-specific DNA from the first species 
highly enriched for evolutionary-conserved sequen- 
ces 



15 



Note: Source and subtractor DNAs are interchangeable for this embodiment. 



Applications: These probes should become highly useful for comparative chro- 
20 mosome mapping. For example, FISH of this probe to metaphase chromosomes 
of the second should light up the homologous segments. Fractionation of all 
chromosome-specific paint probes of a given species, e.g. Human, in this way 
should provide tools for comparative mapping even beyond the mammalian 
order. In combination with multicolour FISH the rapid establishment of compara- 
25 tive chromosome maps even for very distantly-related species will become 
feasible. 
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(5) Selection of DNA molecules exclusive to one tissue or developmental 
stage 



Type of selection: 


rxjs/zvve 


Source DNA: 


Repeat-free, PCR-amplified, functionally-defined 
subsets of DNA (see embodiment 2) from one or 
more tissues or one or more developmental stages 
of an organism. 


bubudctor una. 


|ncpcd L'llcc/ r on-dilipn llt!U, 1 UIIL. LIUI Id 1 ly-Uc I lucU 

subsets of DNA (see embodiment 2) from different 
tissues or developmental stages of the same organ- 
ism. 


Desired DNA: 


PCR-amplified DNA from bound fraction after 
streptavidin-magnetic particle affinity chromatogra- 
phy. 


Expected characteristics 
of new probe: 


Tissue-specific or developmental stage-specific 
DNA. 



10 



Note: Source and subtracter DNAs are interchangeable for this embodiment. 

Applications: Sets of DNA molecules can be used to isolate genes active in 
particular tissue(s) or developmental stage(s). 

15 

The following examples illustrate further the present invention. 
Biotinvlation of Cot-1 DNA 

20 CoM DNA (Gibco BRL or made in our lab) was labelled with biotin using a Biotin 
Chem-Link kit (Boehringer Mannheim), Manufacturer's instructions were followed 
except that the reaction was scaled up and 25/;g (25/;l) DNA was labelled with 
25//I Biotin Chem-Link and the volume made up to 75//I with distilled water. 
After labelling (30 min at 85 °C), 25//I "Stop Buffer" (provided with the Biotin 

25 Chem-Link kit) was added and the products cleaned using standard Sephadex 
G50 columns (Sambrook et al. 1 989) or BioRad Micro Bio-Spin 30 Tris columns. 
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Preparation of source DNA 

All PCR reactions were performed in the following buffer (modified slightly from 
Telenius et al. 1992): 1x Taq DNA polymerase buffer (50mM KCI, 10mM 
5 Tris.HCI pH 8.3, Perkin Elmer), 2mM MgCI 2 , 0.2mM of each dNTP, 1 .7//M primer 
6MW <5'-CCG ACT CGA GNN NNN NAT GTG G-3'), 100ng template DNA and 
5 units of Taq DNA polymerase (Pharmacia), in a final volume of 100/yl. Screw- 
capped tubes were used for amplification (using a Biomed Thermocycler 60); no 
oil overlay was used. The first stage of probe preparation was DOP-PCR (Teleni- 

10 us et al. 1992). PCR conditions were as follows: 96°C for 5 min, followed by 
the addition of Taq polymerase; 5 cycles of 96°C for 1 min, 30°C for 1 ,5 min, 
72°C for 3 min with 3 min time ramp from 30°C to 72°C; 35 cycles of 96°C 
for 1 min, 62°C for 1.5 min, 72°C for 3 min, with the addition of 1 sec. per 
cycle to the extension time. PCR products were purified using a QIAquick PCR 

15 purification kit (GicgGn) according to manufacturer's instructions. The second 
stage involved amplification of 100ng of products from the first PCR reaction 
with a newly-designed primer "CTA 4 D0P" <5'-CTA CTA CTA CTA CCG ACT 
CGA G-3'). PCR conditions were as follows: 96°C for 5 min, followed by the 
addition of Taq polymerase; 4 cycles of 96°C for 1 min, 48°C for 1.5 min, 

20 72°C for 3 min; 17 cycles of 96°C for 1 min, 55°C for 1.5 min, 72°C for 3 
min, with the addition of 1 sec. per cycle to the extension time. DNA was 
purified as above. 

Hybridisation of biotinvlated Cot-1 and source DNA. 

25 

200ng source DNA was added to a 50-fold excess (10//g) of biotin-labelled Cot- 
1 DNA. The DNAs were precipitated together by addition of 1/10 volume 3M 
sodium acetate followed by a standard ethanol precipitation. Dried mixtures of 
biotinylated Cot- 1 DNA and source DNA were redissolved in 1 0Q/j\ 6xSSC/0. 1 % 
30 SDS, denatured by boiling for 10 min and hybridised at 65 °C overnight (Rou- 
quier et al, 1995). 
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Selection of unbound source molecules using negative selection with streptavi- 
din affinity chromatography. 



Prior to streptavidin-magnetic particle affinity chromatography, the two hybri- 
5 dised DNAs were cooled to room temperature. 2mg (200//I) streptavidin magne- 
tic particles (Boehringer Mannheim) were prepared according to manufacturers 
instructions and resuspended in 1 25/>l of 10mM Tris.HCI, pH 8.0/1 mM EDTA pH 
8.0/2M NaCI {2x binding and washing buffer). 1 00/yl streptavidin magnetic 
particles were added to 100//I hybridised DNA mixture and incubated at room 

10 temperature for 30 min with axial rotation. Tubes were then applied to a magne- 
tic particle separator (Boehringer Mannheim) for 3 min and the supernatant 
gently removed. This supernatant was added directly to the remaining, unused 
magnetic particles with buffer freshly removed, and incubated with axial rotation 
as above. The second supernatant (200/yl) was removed and DNA purified using 

1 5 a QIAex II kit (Giagen) according to manufacturers instructions, and resuspended 
in 25/yl TE (10mM Tris.HCI, pH 8.0/1 mM EDTA pH 8.0). 

Reamplification of selected source DNA. 

20 5//I (approximately 30ng) of purified, selected source DNA was subjected to PCR 
with the primer CTA 4 D0P as above, without the four low annealing temperature 
cycles. DNA was purified as for previous PCR amplifications. At this stage, one 
round of selection was complete. If a further round of selection was required, 
this DNA would be treated as source DNA and subjected once more to hybridis- 

25 ation with Cot-1 , affinity chromatography and PCR. 

Fluorescence in situ hybridisation. 

Probes were labelled with biotin-1 6-dUTP (Boehringer Mannheim) using standard 
30 nick translation procedures. Hybridisation to metaphase chromosomes was 
carried out as follows: slides were denatured for 2 min at 72 °C in 70% formami- 
de/0.6xSSC/pH7.0; thereafter slides were put through an ice cold ethanol series 
(70%/90%/100%) and air-dried. 250ng of probe together with 50^/g sonicated 
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salmon testis DNA (Sigma) were ethanol precipitated, dried, and redissolved in 
1 0^/1 hybridisation solution (50% formamide/1xSSC/10% dextran sulphate). 
Slides were denatured in 70% formamide/2xSSC for 2 min at 72°C and put 
through another ethanol series; probes were denatured at 75 °C for 5 min and 
5 not reannealed. Probes were then added to slides, covered by an 18 x 18 mm 
coverslip, sealed with rubber cement and hybridised at 37°C overnight. Slides 
were washed 3x5 min in 50% formamide/1xSSC and another 3x5 min in 
O.lxSSC at 60° C. Thereafter, they were incubated for 30 min with 4% 
BSA/2xSSC/0.1 % Tween 20 at 37°C (a blocking step to reduce background). 

10 Slides were then washed briefly in 4xSSC/0.1 % Tween 20 then incubated with 
1 ng///l avidin-FITC (Vector) in 100//I 4xSSC/0.1% Tween 20 for 30 min at 
37°C. Slides were then washed 3x2 min in 4xSSC/0, 1 % Tween 20 at 45 °C, 
and incubated in 2.5ng//;l biotinylated anti-avidin (Vector) in 100//1 4xSSC/0.1% 
Tween 20 for 30 min at 37°C. Slides were washed and incubated once more 

15 with avidin F'TC and washed again, as above. Chromosomes were counter- 
stained in 20ng/ml DAPI in 4xSSC/0,1% Tween 20 for 5 min, rinsed in distilled 
water and mounted in Vectashield antifade mounting medium (Vector). 

Fluorescence microscopy. 

20 

A Zeiss Axiophot microscope equipped with a 100W mercury lamp was used 
with DAPI and FITC Chroma filters (AHF Tubingen, Germany). All images were 
acquired with a Plan Neofluar 100x/1.3 oil immersion lens. Photomicrographs 
were taken directly through the microscope with Kodak Ektachrome 400 film; no 
25 image processing was employed. 

The approach outlined above (using two rounds of negative selection) is suc- 
cessfully used to remove repetitive DNA from probes (and probe sets) which 
formerly were or were not PCR-amplified (Figure 1b). Of the already DOP-PCR- 
30 amplified probe sets, both single arm painting probes (e.g. human chromosome 
15q, see Figure 2) or whole chromosome painting probes (e.g. human chromoso- 
me 6, results not shown) habe been repeat-depleted. Further, genomic DNA 
after first making it amplifiable with our stringent DOP-PCR primers (Figure 2), 



WO 98/15649 PCT/EP97/04199 

23 

has been repeat-depleted. The same was done with PAC and YAC clones 
(results not shown), FISH of the selected probes without Cot-1 DNA was used 
to demonstrate the effectiveness of the selection technique (Figure 2a, b). 
Hybridisation of the depleted chromosome 1 5q-specific probe library showed a 
5 strong hybridisation to chromosome 1 5q only (figure 2c,d). Hybridisation of the 
treated genomic DNA showed a strong hybridisation to all regions of the genome 
except for those known to exclusively contain repetitive DNA (e.g. centromeric 
and Y-specific satellite DNA (figure 2b). This has demonstrated that not only can 
make a range of FISH probes PCR-amplifiable, but also that both interspersed 
10 (e.g. Alu repeats present on 15q) and clustered repeats (e.g. centromeric and 
heterochromatic satellite DNA present in genomic DNA) can be removed from 
such probes. 
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Claims 



5 
10 
15 

2. 

20 3. 
25 4, 



A method of purifying specific DNA molecules, comprising the steps of 

(a) mixing a set of DNA molecules as source DNA containing said 
specific DNA molecules as a subset, with a set of molecules as 
subtracter having substantially 

(i) an affinity for the subset, or 

(ii) no affinity for the subset, 

(b) performing a binding reaction between source DNA and subtractor 
in solution, 

(c) separating the subtractor which is present unbound or bound to 
source DNA, by binding the subtractor \o a matrix material contai- 
ning compounds having an affinity to the subtractor, from the 
reaction mixture, and 

(d) recovering said subset which is either 

(i) bound to the subtractor, or 

(ii) not bound to the subtractor. 

The method according to claim 1 , wherein the subtractor is a set of DNA 
molecules and the binding reaction in step (b) is a hybridisation. 

The method according to claim 2, wherein the hybridisation is performed 
by denaturing source DNA and subtractor mixed together in a buffer 
containing 0.075 M to 1.5 M NaCI, at 90 to 100°C for 1 to 10 minutes 
and reannealing at 60 to 70°C for 5 to 48 hours. 

The method according to claim 2, wherein the hybridisation is performed 
by denaturing source DNA and subtractor mixed together in a buffer 
containing 40 to 70% v/v formamide, 0.03 M to 0.75 M NaCI, at 60 to 
80°C for 1 to 10 minutes and reannealing at 30 to 50°C for 5 to 48 
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hours. 



5. The method according to anyone of claims 1 to 4, wherein the source 
DNA is a DNA probe used in FISH. 

6. The method according to anyone of claims 1 to 5, wherein the source 
DNA comprises DNA of a defined function or a repeat-free chromosome- 
specific library. 

7. The method according to claim 6, wherein the source DNA is selected 
from the group consisting of cDNA, CpG islands, scaffold-attached DNA 
and DNA of a defined replication time. 

8. The method according to anyone of claims 1 to 6, wherein the sub- 
tractor is a set of labelled molecules. 

9. The method according to claim 8, wherein the molecules are labelled 
with biotin, or digoxygenin. 

10. The method according to anyone of claims 1 to 9, wherein the sub- 
tractor is CoM DNA. 

11. The method according to anyone of claims 1 to 10, wherein the sub- 
tractor comprises a repeat-free chromosome-specific library or DNA of a 
defined function. 



12. The method according to claim 11, wherein the DNA of a defined 
function is selected from the group consisting of cDNA, CpG islands, 
and scaffold-attached DNA. 

13. The method according to anyone of claims 1 to 12, wherein the source 
DNA comprises repeat-free DNA from one species and the subtractor 
comprises repeat-free DNA from another species. 
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The method according to anyone of claims 1 to 12, wherein the source 
DNA comprises repeat-free DNA from one species or DNA sequences 
shared between two different species, and the subtracter is a repeat-free 
chromosome-specific library from another species. 

The method according to anyone of claims 1 to 12, wherein the source 
DNA is a repeat-free chromosome-specific library from one species, and 
the subtractor comprises repeat-free DNA from another species or DNA 
sequences shared between two different species. 

The method according to anyone of claims 1 to 1 2, wherein the source 
DNA comprises repeat-free DNA from one particular tissue and/or deve- 
lopmental stage, and the subtractor comprises repeat-free DNA of a 
defined function from another particular tissue and/or developmental 

stage of the same organism. 

The method according to anyone of claims 1 to 1 6, wherein compounds 
contained in the matrix material used in step (c) are immobilized to the 
matrix material. 

The method according to claim 17, wherein the matrix material contains 
streptavidin. 

The method according to claims 1 7 or 18, wherein the matrix material is 
streptavidin-conjugated magnetic beads. 

The method according to claim 1, wherein the source DNA contains 
thymidine analogs, the subtractor contains anti-thymidine analogues 
antibodies, and the matrix material contains compounds capable of 
binding to the anti-thymidine analogue antibodies. 

The method according to claim 1, wherein the source DNA contains 
BrdU, the subtractor contains anti-BrdU antibodies, and the matrix mate- 
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rial contains compounds capable of binding to the anti-BrdU antibodies. 



22. The method according to anyone of claims 1 to 21 , wherein prior to step 
(a), the source DNA are subjected to PCR. 

23. The method according to claim 22, wherein the PCR is DOP-PCR. 

24. The method according to anyone of claims 1 to 23, wherein prior to step 
(a), the DOP-PCR-amplified source DNA is subjected to a further round 
of PCR with a primer whose 5'-portton comprises a sequence of nucleoti- 
des not present in the DOP primer used in claim 23, and whose 3'- 
portion consists of a number of the non-random nucleotides from the 5'- 
end of the DOP-primer used in claim 23. 

25. The method according to anyone of claims 22 to 24, wherein the DOP- 
PCR primer has the sequence 

5'-CCG ACT CGA GNN IMNN NAT GTG G-3' 
wherein N may be any nucleotide. 

26. The method according to claim 24 or 25, wherein the second primer has 
the sequence 

5'-CTA CTA CTA CTA CCG ACT CGA G-3\ or 
5'-TGA TCA CGC TAC CCG ACT CGA G-3'. 

27. The method according to anyone of claims 1 to 26, wherein after step 
(d) the recovered subset is subjected to PCR. 

28. The method according to anyone of claims 1 to 27, wherein steps (a) to 
(d) are repeated at least once using each recovered subset obtained in 
step (d) as source DNA in step (a). 

29. Use of the subset obtained according to the method of anyone of claims 
1 to 28 as probes for DNA-DNA hybridisations. 



WO 98/15649 PCT/EP97AM199 

32 

30. A DNA probe obtainable according to anyone of claims 1 to 28, contai- 
ning less than 2% CoM DNA. 
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